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Abstract: Phylogenetic networks are rooted acyclic directed graphs in 
which the leaves are identified with members of a set X of species. The cluster 
of a vertex is the set of leaves that are descendants of the vertex. A network is 
“distinct-cluster” if distinct vertices have distinct clusters. This paper focuses 
on the set DC(X) of distinct-cluster networks whose leaves are identified with 
the members of X. For a fixed X , a metric on DC(X) is defined. There is a 
“cluster-preserving” simplification process by which vertices or certain arcs may 
be removed without changing the clusters of any remaining vertices. Many of 
the resulting networks may be uniquely determined without regard to the order 
of the simplifying operations. 
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1 Introduction 

It is common in biology to describe evolutionary history by means of a phylo¬ 
genetic tree T. (See the book [2l] for many details.) In such a tree, the leaf 
set corresponds to a set X of taxa on which measurements such as on DNA can 
be made. Internal vertices correspond to ancestral species. Branching corre¬ 
sponds to speciation events by some isolation mechanism. Typically the trees 
are assumed to be rooted in the distant past, perhaps by means of an outgroup. 

Recently, the roles of hybridization and lateral gene transfer have been seen 
to be important; see m, i. 0 , m- Models of such events sometimes lead 
to rooted acyclic networks which, unlike trees, allow branches to recombine. 
Overviews for such networks are found in m m, ubi , m- 
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The number of possible rooted acyclic networks with a given leaf set X is 
infinite. Some researchers have focused attention on networks with additional 
properties. Such classes include networks that are regular [2j, normal [24], tree- 
child [7], galled trees 033 , or level -k |22) . 

Formal definitions are given in Section 2. This introduction will give a rough 
idea of some of the critical concepts. 

If TV is a finite acyclic rooted network with leaf set A', for each vertex v 
the cluster cl(v;N) of v (or cliv) if N is clear from the context) is the set of 
x £ X which are descendants of v along any directed path in N. Here we utilize 
the “hardwired” clusters in the sense of m ■ The clusters are important for 
interpreting biological networks. For example, in a biological network cl(y ) is 
the set of extant species whose genome possibly contains mutations originating 
in the ancestral species v. An example of a network N is shown in Figure 1 
with X = {1,2, 3,4}. In N, cl(8) = {1,2,3} and cl(3) = {3}. 



Figure 1: A distinct-cluster phylogenetic network N. 


A network N with vertex set V is distinct-cluster (DC) provided that when¬ 
ever v and w are distinct vertices, then cl(v) ^ cl(w). Thus distinct vertices 
have different clusters. Let DC(X) denote the set of distinct-cluster acyclic 
rooted networks with leaf set X up to isomorphism. The network in Figure 1 
lies in DC(X). 

This paper studies DC(X ), which includes all trees, regular networks, and 
normal networks with leaf set X. 

It is useful to have a quantitative measure of the difference between two 
networks with the same leaf set X. In comparing two trees, there are several 
metrics, including the Robinson-Foulds metric |20j . the nearest neighbor inter¬ 
change metric EE EE the triples distance [8], the SPR distance PQ, and the 
matching distance [13]. 

Generalizing to networks that are not necessarily trees, [2: gives a metric 
between two regular networks. The path-multiplicity distance or /z-distance is 
a metric between two tree-child networks [7] and also between two networks 
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that are tree-sibling and also time-consistent [3]. The tripartition distance [6] 
is a metric on rooted phylogenetic networks that are tree-child and also time- 
consistent. The nested-labels distance [5] is a metric on tree-child networks. 

This paper defines a metric on DC{X). If N\ and N 2 are in DC(X ), the 
inheritance metric D{N\,N 2 ) between them is defined in Section 4. The main 
tool is a matrix H (or H(N) to specify the network) for N £ DC{X) such that 
for each pair of vertices u and v. H U ^ V (N) is the number of distinct directed 
paths in N from u to v. Since N is DC, we may identify u and v with their 
clusters. Now suppose Ni and N 2 are in DC(X). Suppose that vertices u\ and 
Vi of Ni and vertices u 2 and v 2 of N 2 satisfy that cl(ui;Ni) = cl(u 2 ;N 2 ) and 
cl(vi\Ni) = cl(v 2 ; N 2 ). Then u\ and u 2 may be identified because they have the 
same cluster and both networks are DC. Similarly vi and v 2 may be identified. 
A direct comparison is now possible between H UltVl (N\) and H U2jV2 (N 2 ) since 
both count the number of directed paths between u\ = u 2 and v\ = v 2 in 
their respective networks. The term \H UliVl (Ni) — H U2>Vo (N 2 )\ contributes to 
D(N l7 N 2 ). 

It is interesting that the matrix H includes the information utilized in [7] to 
produce a metric on tree-child networks. More precisely, if N is both DC and 
tree-child, then the vectors p(v) utilized in jjj have the entries H v x for x £ X. 

Even DC networks can be very large. The number of vertices in a member 
of DC(X ) can grow exponentially with |X|, bounded by the number of distinct 
nonempty subsets of X. Huge networks can be difficult to interpret, and it 
may be useful to “simplify” a network into one with fewer vertices and arcs 
that is potentially easier to understand and which summarizes certain aspects 
of the original network. For example, one might ask for a natural procedure to 
simplify a huge network into a tree or into a tree-child network. Alternatively, 
one might ask which trees or normal networks N 2 “best fit” a given network Ni 
in the sense of minimizing D(Ni, N 2 ). 

This paper describes one simplification process: If N is in DC{X) 1 a cluster¬ 
preserving simplification (CPS) of N is a member of DC(X) obtained recursively 
by removing a vertex or an arc from N while taking care that each remaining 
vertex has its cluster unchanged (preserved). In Section 5 we give definitions of 
the basic operations (defined so as not to change the clusters). 

In Section 6 we prove that to a large extent the order of the operations is 
immaterial. An arc (u, v) is redundant if there is a directed path from u to v 
other than the path along the arc (u,v) itself. For example, in Figure 1, (9,3) 
is redundant. A key result is Theorem 7.3, which asserts that when V' is a 
subset of V(N) containing the root r and each member of X , then there exists 
a unique N’ £ DC(X ) such that V(N’) = V', N' has no redundant arcs, and 
N' is a CPS of N. 

In particular, Theorem 7.3 says that the order of the operations in forming 
the CPS does not matter when the CPS has no redundant arcs. Thus each 
member of a large family of CPS of N depends only on its vertex set. Moreover, 
Theorem 7.4 asserts that for two such subsets V' and V", if the unique networks 
are N' and N" respectively, then N" is a CPS of N' iff V" C V’. An example 
is presented in Section 8, while some extensions are described in Section 9. 
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2 Basic notions 


A directed graph (V, E) is a set V of vertices and a set E C V x V where each 
(a, b) £ E is called an arc. We interpret (a, b) £ E as a line segment directed 
from a to b. We assume there is no arc (a, a), so that there are no loops. Since 
E is a set, not a multiset, there are no multiple arcs. 

If a and b are vertices, a directed path from a to b is a sequence a = 
vq, ■ ■ ■ , Vk = b in V such that for * = 0, • • • , k — 1, there is an arc (u;, Uj+i) £ E. 
The length of the path is k. There is always a directed path of length 0 from a to 
a. An arc (a, b) is redundant if there exists a directed path a = vq, V\, ■ ■ ■ , Vk = b 
with k > 1 (so that the path is not the same as the path consisting of the single 
arc (a, b)). 

If q and c are vertices then c is a child of q and q is a parent of c iff there 
exists an arc ( q , c) £ E. A child c of q is a tree-child iff q is the only parent of 
c. A vertex is hybrid if it has more than one parent. The out-degree of a vertex 
v is the number of children of v, while the in-degree of a vertex v is the number 
of parents of v. A vertex is a leaf if it has no child. 

Let X be a set (for example, of biological species). If L is the set of leaves, 
we shall assume there is a bijection if : X —> L. Usually we will identify x £ X 
with ip(x) £ L, so if x £ X we may write x £ L. Let V(X ) denote the set of all 
nonempty subsets of X. 

A directed graph (V, E) is rooted with root r if there exists a vertex r such 

that for every v £ V there is a directed path from r to v. In Figure 1, N is 

rooted with root r = 9. 

A directed graph is acyclic if there is no directed cycle; i.e., there is no vertex 
v with a directed path from v to v of length greater than 0. If the network is 
both rooted and acyclic, then the root is necessarily unique. 

If N = (V, E) is acyclic, write u < v iff there is a directed path in N from 

w to n. Since N is acyclic it follows that if u < v and v < u then u = v. 

Transitivity is obvious, so < is a partial order on V. If u < v and a/t), write 
u < v. 

An X-cluster is a nonempty subset of X. If v is a vertex, then the cluster 
cl(v) of v (or d(v;N ) if we need to specify the network N) is d(v) = {x £ 
X : v < a’}. It is thus an A'-cluster containing the leaves x £ X such that 
there exists a directed path from v to x (of any length). Note that cl(v) is the 
“hardwired” cluster of v in the sense of [T2]. Since there are no directed cycles, 
a directed path of maximum length starting at v must end at a leaf, so that 
d(v) is nonempty. If x £ X, then cl(x) = {x} via the directed path from x to 
x of length 0. In Figure 1, cl( 6) = {2,3} and cl( 9) = {1,2, 3,4}. 

A cluster is trivial if it has the form X or {.t} where x £ X. Let Tr(X) de¬ 
note the set of trivial clusters for the set X. In Figure 1, Tr(X) = {{1}, {2}, {3}, 
{4},{1,2,3,4}}. 

It is easy to see that if u < v then d(v) C d(u). 

A network is distinct-cluster (DC) if no two vertices have the same cluster; 
i.e., whenever m/d then cl(u) ^ cl(v). If the network is DC, then each vertex 
that is not a leaf has out-degree at least 2. To see this, note that if v has child 
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ci then cZ(ci) C cl(v), so there exists x £ cl(v) — cl(ci) and there must exist a 
child C2 of v such that x £ cl{c 2). The network in Figure 1 is distinct-cluster. 

We summarize the properties we shall require in the definition of an A- 
network: An X-network N is N = (V, E,r, X) where (V,E) is a finite acyclic 
directed network with root r and leaf set A'. We may write V(N) for V or 
E(N) for E. Two A'-networks Aq and N 2 are isomorphic if there is a bijection 
: V(Ni) —» V(N 2 ) such that (<j)(u), £ E(N 2 ) iff (u,v) £ E(Ni) and such 

that the labels of the leaves are preserved. A DC X-network is an A-network 
that is DC. 

There are several special kinds of networks that are of interest. Let N = 
(V, E,r, X) be an A-network. Then N is a tree iff no vertex is hybrid. N 
is tree-child [7] iff every vertex v that is not a leaf has a child c that is tree- 
child. N is normal [24] iff N is tree-child, no vertex has outdegree 1, and in 
addition N has no redundant arc. N is regular [2] iff (1) it is DC and (2) there 
exists an arc (u,v) iff both cl(v) C cl(u) and there is no vertex w such that 
cl(y) C cl(w) C cl(u). 

Every tree, every regular network, and every normal network is DC. There 
exist tree-child networks which are not in DC{X) (for example, Figure 2 of 
IZ!)- Figure 1 is DC. It is not tree-child and not normal since 6 has only hybrid 
children. It is not regular since it contains the redundant arc (9,3). 


3 The inheritance matrix 

Let N = (V, E, r, X) be an A-network (not necessarily DC). Let the vertices be 
«!,■■■ ,v m in some order. The adjacency matrix A of A is the m x m matrix 


A, 


1 if there is an arc ( v j, Vj ) in E 
0 otherwise 


Note that A encodes all the structure of N. Given A, the leaves are those 
Vi such that for all j, Aij = 0. There is an arc ( Vi,Vj ) iff Ajj > 0. 

It is well known (see, for example, [IT]) that for all k > 0, ( A k )ij is the 
number of directed paths of length k from Vi to Vj . 

In this section we define a matrix H , called the inheritance matrix , which 
will be of use in defining a metric on the set of DC A-networks. An example is 
given in Section 8. 

Since N is finite and acyclic, there is a directed path of maximum length. If 
the maximum length is L, there are no paths of length L + 1. 


Lemma 3.1. Let the maximum length of a directed path in N be L. Then 
A k = 0 for k > L + 1. 

Proof. If k > L + 1, then N has no directed path of length k. Hence A k = 0. □ 


Let L be the maximum length of a directed path. Define 


H = I+ A + A 2 -1-b A L . 
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By Lemma 3.1, it is equivalent to write 


H = I + ^AL 
j =i 

Call H the inheritance matrix (from the “h” in “inheritance”). If we need 
to specify the network N, we may also write H(N) instead of H. 

Theorem 3.2. For all vertices v, H VtV = 1. 

Proof. Let v be a vertex. Since N is acyclic, no k > 0 satisfies A(, v > 0. It 
follows that H VtV = I v>v = 1. □ 

Theorem 3.3. For all vertices u and v, H u>v is the number of directed paths 
in N from u to v of any nonnegative length. 

Proof. By the definition, H UtV = I u>v + Y^jL\{ A3 )u,v For each j > 0, (A J ) U , V 
is the number of directed paths of length j from u to v. The trivial path u of 
length 0 from utow corresponds to the term J UjU . □ 

Theorem 3.4. Pi is invertible, and I — A = H~ x . 

Proof. Let L be the maximum length of a directed path. Then A L+1 = 0 and 
(/ — A)H = (J — A)(I + A + A 2 + A 3 + ■ ■ ■ + A L ) = I — A L+1 [by telescoping] 
= 1 . □ 

Corollary 3.5. H = (J — A)~ 1 and A = I — H -1 . 

Corollary 3.6. N can be reconstructed given either A or H. 

Proof. N can be reconstructed from A since A Uj „ > 0 iff there is an arc (u,v). 
But by Corollary 3.5, A can be found from FI. Hence N can be reconstructed 
from H as well. □ 

Theorem 3.7. Suppose N has A and Fd as above. Then 

(i) H -d = Add = HA. 

(ii) df u yd y and u has children ci, ■ ■ ■ , c*, then H U}V = ^ i=i H CuV . 

(in) df u yd v and v has parents q±, ■ ■ ■ ,qu then H u ^ v = Hu.q, ■ 

Proof, (i) H — d = A + A 2 + • • • + A L where A L+1 = 0 

= A + A 2 H-b A L + A L+1 = A{d + A H-b A L ) = AH. A similar argument 

applies for HA. 

(ii) From (i), H — d = AH. Since u ^ v, H UtV = (H — d) u>v = ( AH) UtV = 
Y) A UiW H w ^ v where w ranges over the vertices of N. But if A UiW > 0, w can 
only have values Cj, and A U Ci = 1. 

(iii) From (i), H — d = HA. Since u ^ v, H UtV = ( H — d) UyV = ( HA) U)V = 
H u w A w „ where w ranges over the vertices of N. But if A WiV > 0, then w 

can only have values qi and A quV = 1. □ 

Note that (ii) says that each path from u to v must go through an initial arc 

( u,Ci ). Similarly (iii) says that each path from u to v must go through a final 

arc (qi,v). 
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4 The inheritance metric on DC X-networks 


Fix X and let DC(X) denote the set of isomorphism classes of DC X-networks. 
We regard each member of DC{X) as a rooted acyclic directed X-network 
N = (V, E, r, X) that is distinct-cluster. Two members of DC{X ) are considered 
equal if and only if they are isomorphic. 

Let N = (V. E.r, X) and N' = (V 1 , E',r',X) be in DC(X). Let A and A' 
be the adjacency matrices of N and N' respectively, and let H and H' be the 
inheritance matrices of N and N' respectively. Since N is DC, we may identify 
each vertex u with its cluster d(u;N ), so that H u>v is defined if u and v are 
clusters from X that occur as clusters of vertices of N. In particular if u and v 
are clusters in both N and N', then both H v v and H' u v make sense. Moreover 
both r and r' are identified with the cluster cl(r; N) = X while each leaf x £ X 
is identified with the singleton set {x}. 

Let S be the set of all clusters of N and S' the set of all clusters of N'. Let 
C denote a subset of V(X) that contains both S and S'; thus C is a collection 
of nonempty subsets of X containing S U S'. Define the inheritance matrix qH 
of N over C as follows if u and v are in C : 


cHu,v 


H UtV if both u and v are in S 
0 otherwise 


Note that if u £ C but u £ S then cH u ,u = 0. 

Similarly we define the inheritance matrix cH' of N' over C. Define 

C D(N,N') = ^ [| C H U , V - cKJ : u £ C,v £ C . 

The following lemma says that cD(N, N') does not depend on the choice of 
C. We thus obtain the same number using different choices of C. 

Lemma 4.1. Let N and N' be DC X-networks with cluster sets S and S' 
respectively. Let C and C’ be sets of X-clusters that contain both S and S'. 
Then C D(N,N') = C ’D{N,N'). 

Proof. The only nonzero terms | cH u , v — cH' u v \ or \c'H u ,v — C'H' U V \ in the 
definitions occur when (a) both u and v are in S or (b) both u and v are in S'. 
In either case, \cH u , v ~ cH' u v \ = \c'H UtV - c'H' u v \. □ 

Define D(N,N') to be the common value cD(N, N') for any choice of C 
containing both S and S'. We shall see below that D(N,N') is a metric, so 
we will call D(N, N') the inheritance metric or inheritance distance between N 
and N'. Most commonly one will utilize C = S U S', but different choices may 
be convenient when one is comparing several different networks. 

Roughly, D(N,N') counts the sum, over all u and v, of the absolute value 
of the difference between the number of directed paths from u to v in N and 
the number of directed paths from u to v in N'. 
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Theorem 4.2. Fix X. Then D defines a metric on DC(X). In particular for 
any DC X-networks N, N', and N" we have 

(i) D(N, N 1 ) > 0. 

(ii) D(N, N') = D(N',N). 

(in) D{N,N') < D{N,N") + D(N",N'). 

(iv) D(N, N') = 0 iff N = N'. 

Proof. Pick any C that contains both S and S'. It is immediate that qD{N, N') > 
0 and that qD(N, N r ) =c D(N',N). If N" is a third X-network with cluster 
set S" and C contains S, S ', and S", then when u and v range over members 
of C we have 

cD(N,N') = J2\ c H u , v - cKJ 

= 2 I cH u , v - cH'u v + cH” v - cH' u v \ 

< Y, [I cH U)V - cH" >v I + I cH” v - cH' u v |] 

= I cH u , v - cKA + I C'Hu, v - cH' u v | 

= c D(N,N”)+ C D(N",N’), 
proving the triangle inequality (iii). 

There remains only to show that cD(N , N 1 ) = 0 iff N = N'. It is immediate 
that cD{N,N) = 0. Conversely, suppose cD{N,N') = 0. If |X| = 1, then the 
network with only a single vertex is the only DC X-network, whence N = N'. 
Thus we may assume |X| > 1. 

Since qD(N, N') = 0 it follows that for each u and v in C, \ cH u , v - 
cH' u v \ = 0 so cH u ,v = cH' u v . In particular for each u, cH u>u = cH' u u . But 
■u G S' iff c’Hu.u = 1 and u € S' iff cH' u u = 1 by Theorem 3.3. Hence S = S', 
so N and N' have the same vertex sets. 

If both a and v are in S = S', then since cH UiV = cH' uv if follows that 
H UiV = H' u v . Thus H = H'. By Corollary 3.5, A = I - H - 1 = I - H’~ x = A'. 
It follows that N = N'. □ 

The following Theorem 4.3 tells the distance between two DC X-networks 
chosen to be especially distant from each other. 

The trivial tree Tr(X) is the tree whose clusters are all the trivial clusters 
Tr(X) with one arc (X, x) for each x G X. The regular network of all nonempty 
subsets of X is V(X) with vertex set V{X) and an arc {A, B) iff B C A and 
there is no C G V{X) such that B C C C A. 

Theorem 4.3. Let N and N' be DC X-networks, where |X| = n. Then 

(1) D(N,N') is an integer. 

(2) Suppose Tr(X) is the trivial tree on X and V{X) is the regular X-network 
of all nonempty subsets of X. Then for n > 2, 

D(V(X), Tr(X)) = J2 (?) E CO W - 2n ~ 1- 

k=1 ^ ' j=l ' 

Proof. (1) Since both cH u , v and cH' u v are integers, (1) is immediate. 

(2) We first observe that in V{X), if A C B and \B\ — \A\ = k, then the 
number of paths from B to A in N is k\. To see this, list the elements of B — A 



as 1, • • •, k. Each arc has as its endpoint a subset containing one fewer member 
of X than the subset at the starting point of the arc. Denote the path by 
B = So, S 4 , S 2 , - ■ ■ Sk = A. But there are k ways to choose Si (deleting one 
member), then (fc — 1) ways to choose S 2 , then (fc — 2) ways to choose S 3 , etc. 
The total number of paths is then k\. 

Now we find Hb,a for all possible B and A. If the set B has size k then 
the number of ways to choose B is (^), where 1 < k < n. If B has been chosen, 
then the number of ways to choose A is (*) , where j = |A|, and 1 < j < k. 
The number of paths from B to A is then {k — j)\. Hence the total number of 
paths is 

k =1 v 7 i=l V7/ 

Note that Tr(X) has one arc (X, x ) for each x € X. Hence H u . v (Tr(X )) = 0 
unless u = r and v £ X or else u = v = r, or else u = x = v for some x £ X. 
In any of these cases H u v (Tr(X)) = 1 . Hence the total sum of the entries is 
n + l + n = 2n + l. 

The difference yields (2) since in any case when H u , v (Tr(X)) = 1 then 
H u . v {V{X)) >1. ‘ □ 

If B n = D{V{X),Tr(X)) with |X| = n, then B 2 = 0, B 3 = 15, B 4 = 94, 
B 5 = 535, Bq = 3287. I conjecture that for any n > 2, D(V(X),Tr(X)) is the 
maximum value of any D(N,N'). 


5 Cluster-preserving simplifications 

Let N = (V, E,r, X) be a DC V-network. We consider two simplifying steps: 

(1) Suppose v £ V, v £ X, v ^ r. We delete vertex v by passing through to 
form a new network N' as follows: Let qi, ■ ■ ■ ,qk denote all the parents of v, 
and let Ci, • • • , c m denote all the children of v. Remove v and all arcs involving 
v from E. Add new arcs ( q j, Cj) for * = 1, • • • , k\ j = 1, • • - , m. (If one already 
exists, then we retain just the single copy.) Let N' denote the result. Thus 
N 1 = ( V',E',r,X) where V' = V-{v}, E' = [E-{(q i ,v),(v,c j )}]u{{q i ,c j )}. 
Alternatively we will denote the result as N' = D(v)N , meaning deletion of v 
from N. 

(2) Let (a, b) £ E be a redundant arc in N. Form N' = (V,E',r,X) where 
E’ = E — {(a, b)}. Thus N' is N with arc (a, b) removed. Alternatively we will 
denote the result as D(a, b)N , meaning deletion of the redundant arc (a, b) from 

N. 

Figure 2 shows a DC X-network N with X = {1, 2, 3,4} and redundant arc 
(10,4). The networks D(6)N and D(10,4)N are also shown. Vertices continue 
to be labeled with the same labels of N when they have the same clusters. Note 
that D(6)N contains the redundant arcs (8,2) and (9,3) which were not present 
in N. In general, deleting a vertex can introduce new redundant arcs. 
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Figure 2: N with D(6)N and I?(10,4)7V. 


Theorem 5.1. Let N = (V,E,r,X) be a DC X-network. Let v be a vertex, 
v r, v ^ X. Let (a, b ) be a redundant arc of N. 

(1) For vertices u and w distinct from v, there is a directed path from u to w in 
D(v)N iff there is a directed path in N from u to w. 

(2) D(v)N is a DC X-network. 

(3) For every vertex w in N if w v, cl(w; D(v)N) = cl(w;N). 

(4) For vertices u and w, there is a directed path from u to w in D(a,b)N iff 
there is a directed path in N from u to w. 

(5) D(a,b)N is a DC X-network. 

(6) For every vertex w in N, cl( w; D(a,b)N) = cl(w;N). 

(7) Suppose u and w are vertices of N other than v. Then u < w in N iff u < w 
in D(v)N. 

(8) Suppose u and w are veHices of N. Then u < w in N iffu < w in D(a , b)N. 

Proof. If N' = D(v)N , then ( V',E') is a directed graph. By the restriction on 
the choice of v, it follows that V' contains r and each member of X. 

We first show (1). Suppose u and w are vertices of N distinct from v. First 
suppose that there is a directed path from u to w in N. If u = t>o, Vi, ■ ■ ■ , Vk = w 
is a directed path in N from u to w, then the same path is a directed path from 
u to w in N' if no vertex Vi equals v. If, on the other hand Vj = v for some 
0 <j<k then Vj -1 is a parent of v and Vj +1 is a child of v. Hence the directed 
path u = Vo, Vi, ■ ■ ■ , Vj- 1 , Vj+i, ■ ■ ■ ,Vk = w is a directed path from u to w in 
N'. 

Conversely let u = vq, v\, ■ ■ ■ , Vk = w be a directed path in N'. The only 
way this could fail to be a path in N is if, for some j, {vj- 1 , Vj) is not an arc of 
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N. By construction this means that Vj-\ is a parent of v and Vj is a child of v. 
Hence u = Vo, v\, ■ ■ ■ , Vj, ■ ■ ■ , Vk = w is a directed path from u to w in 

N. This proves (1). 

Since N was acyclic, it follows that N' is acyclic because a cycle starting 
and ending at w in N' would imply a cycle in N as well. (3) follows from (1) 
since for x € X, x € cl(u; N ) iff there is a directed path in N from u to x, and 
similarly for N'. But then N' is distinct-cluster since N was. In addition r 
remains a root of N' since for each vertex w there remains a path from r to w. 
Thus (2) is true. 

Now let N 1 = D(a,b)N. Thus N' = (V,E r ) where E' = E — {(a, b)}. Note 
that (V, E') is a directed graph. 

To prove (4), let u and w be vertices of N. Note that any directed path in 
N' is also a directed path in N. Conversely, suppose u = v o, IT, • • • , v n = w is 
a path in N from u to w. If there is no j such that (a, b) = (yj,Vj+ 1 ), then it 
remains a path from u to v in N'. If, however, a = Vj, b = Vj + 1 , then since 
(a, b) is redundant we may choose a path a = uo,- ■ ■ , u m = b from a to b in N 
with m > 1 that does not include the arc (a, b). Then u = vq, • • ■ , Vj = a = 
Uo, Mi, • • • , u m = b = Vj+ 1 , Vj+ 2 , • • • , v n = w is a path in N' from u to w. This 
proves (4). 

Since N was acyclic, by (4) N' is acyclic. Note that r remains a root of N' 
since for every vertex v, r < v in N, whence r < v in N 1 . Moreover if u (f X , 
then cl(u; N) = {x € X : u < x in N} = {x £ X : u < x in N'} = cl(u; N 1 ). 
This proves (5) and (6). 

(7) and (8) are restatements of (1) and (4) respectively. □ 

Let N and N' be DC X-networks. We call N' a cluster-preserving simplifi¬ 
cation (CPS) of N provided there exists a sequence N = N 0 , ■ ■ ■ ,Nk = N' of 
DC X-networks such that for i = 0, • • • , k — 1, either X i+1 = D(v)Ni for some 
v € V(Ni) or N i+ 1 = D(a, b)Ni for some redundant arc (a, b) G E(Ni). 

Theorem 5.2. Let N be a DC X-network and N' be a CPS of N. Then 

(1) N' is a DC X-network. 

(2) For each v € V(N'), there exists a unique vertex <f>{v) € V(N) such that 
cl{(l>{v)-,N) = cl{v,N'). 

(3) If (u, v) is an arc of N', then there is a directed path from f>(u ) to <f>(v) in 
N. 

(4) If u and v are in V(N') and there is a directed path from to <p(v) in 
N, then there is a directed path from u to v in N'. 

(5) Suppose u and w are vertices of N'. Then 4>{u) < <f{w) in N iff u < w in N'. 


Proof. Let N = Nq, ■ ■ • , Nk = N' be a sequence of DC X-networks such that for 
i = 0, • • • , k — 1, either X, ;+1 = D(u)X, ; for some v € V(N,) or X i+1 = D(a, b)Ni 
for some redundant arc (a, b) € E(Ni). 

(1) is immediate from Theorem 5.1. 

For (2) given 1 < j < k, define (j>j : V(Nj) —> V(Nj-i) by ( pj(w ) = the 
unique vertex in V{Nj_ i) such that cl(</>j(w); Nj- 1 ) = cl(w;Nj). Define (j> : 
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V(N') —* V(N) by <j>{v) = f>\ o <^ 2 ° • • • o cj> k . By Theorem 5.1, cl((j)(v)-, N ) = 
cl(v;N'). Since N is DC, there exists at most one vertex with a given cluster, 
proving uniqueness. 

(3) and (4) follow from (7) and (8) of Theorem 5.1. (5) follows from (3) and 

(4). □ 

Observe that by Theorem 5.2(2), each vertex in N' has the same cluster 
as the corresponding vertex in N. This justifies the name “cluster-preserving.” 
Moreover, directed paths from u to v in N' correspond to directed paths from 
<j>(u) to 4>(v) in N. Thus much essential information about N is preserved in 
N'. 

We will generally identify a vertex v in N' with the vertex <j>(v) in IV, so we 
may say that each vertex of a CPS N' of N is also a vertex of N. 

The number of vertices of N' could be considerably smaller than the number 
of vertices of N. In some cases the number of arcs in N' could be greater than 
the number of arcs in N. 

Theorem 5.3. Suppose that the DC X-network N is a tree. Then every CPS 
of N is also a tree. 

Proof. Let the CPS N' of N be given by N 1 = D k ■ • ■ DiN, where each D, has 
form either D(v) or D[a, b). We prove the theorem by induction on k. If k = 0, 
the result is immediate. Assume the result is true for k, and we will prove the 
result for k+1. Now let the CPS N' of N be given by N' = D k+ i ■ ■ ■ D\N. We 
prove that N' is a tree. 

Since N is a tree, it has no redundant arc, so there must be a vertex v of N 
such that D\ = D{v). Since v ^ r , v has at least one parent, and since N is a 
tree, v has at most one parent. Let q denote the unique parent of v. Since v £ X, 
v has at least one child. But if v had only one child c then d(v; N) = d(c; N), 
contradicting that N is distinct-cluster. Hence v has children ci, C 2 , • • • , c m for 
some m > 2. In D(v)N vertex v and all arcs incident with v are deleted, while 
there are new arcs (q, c,) for i = 1, • • • ,m. Each c, had a unique parent v in N 
and now has a unique parent q in N'. Each vertex of N 1 other than the root or 
Ci has the same unique parent as in N. Hence D(v)N has no hybrid vertex and 
D(v)N = DiN is a tree. Write M = DiN. Now N' = D k+ i ■ ■ ■ D 2 M. Since 
there are k factors and M is a tree, it follows that N' is a tree by the inductive 
hypothesis. □ 

6 Modifying the description of a CPS 

A CPS N' of N can be written in the form N' = D k ■ ■ ■ D\N where, letting 
Ni = Di - ■ ■ D\N, each D,; has form either D(v) for v a vertex of A^j_i or 
D(a,b) for (a, b) a redundant arc of Ni_i. Note that the redundant arc (a, b) 
might not have been present in N but rather have been introduced via some Dj 
with j < i — 1. In this section we show that this description may be changed 
in various ways. For example D(v)D(w)N = D(w)D(v)N if both w and v are 
vertices of N and v ^ w. 
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Theorem 6.1. Let N be a DC X-network. Suppose v is a vertex and (a,b) is 
a redundant arc of N. 

(i) If w is a vertex and w ^ v, then D(v)D(w)N = D(w)D(v)N. 

(ii) Assume » / a and » / t. Suppose further that it is false that (a, v ) and 
(v,b ) are both arcs in N. Then D(v)D(a,b)N = D(a,b)D(v)N. 

(Hi) Assume v ^ a and v ^ b. Assume that (a,v) and (v,b) are arcs in N. 
Then D(v)D(a , b)N = D(v)N. 

(iv) If v = a , write q±, ■ • • ,qk for the parents of a. Then 

k 

D(a)D(a, b)N = [ D(q u b)] D(a)N. 

2=1 

(v) If v = b, write ci, • • • , Ck for the children of b. Then 

k 

D(b)D(a, b)N = [ D(a, Cj )] D(b)N. 

3 =1 

Proof, (i) Case 1. Suppose neither v nor w is a parent of the other. Let 
qi, ■ ■ ■ , qu be the parents of v and ci, ■ ■ ■ ,c n the children of v. Let ri, • • • , n be 
the parents of w and d\, ■ ■ ■ , d r „ the children of w. By the assumptions, w is 
not any qi nor cp and v is not any r, or di. 

Then D(v)N has new arcs (qi, Cj). Neither of the vertices of such an arc is 
w. Now delete w. In D(w)D(v)N we get new arcs ( ri,dj ) since v is not any r, 
nor di. 

A similar argument shows that we obtain the same arcs in D(v)D(w)N. 
Case 2. Suppose one vertex is a parent of the other. Without loss of gener¬ 
ality, assume v is a parent of w. 

Let q \, ■ ■ ■ ,qk be the parents of v and c\, ■ ■ ■ ,c n ,w the children of v. Let 
v,r i, • • • , r'i be the parents of w and d\, ■ ■ ■ , d m the children of w. 

First we remove v. The new arcs of D(v)N are ( qi,Cj ) and ( qi,w ). Next 
we remove w. The arcs ( qt,w ), (r^tu), and ( w,di ) are removed while the arcs 
( qpCj ) remain. The new arcs in D{w)D{v)N are thus ( ri,dj ) and ( qi,Cj). 

Alternatively we first remove w from N. The new arcs of D(w)N are ( r*, dj) 
and ( y,dj). Next we remove v. The arcs ( v,dj ) are removed while the arcs 
( Ti,dj ) remain. The new arcs of D(v)D(w)N are thus ( qi,Cj) and (). 

The vertices and arcs of D(v)D(w)N and D(w)D(v)N are the same, so the 
conclusion follows. 

(ii) Let v have parents qi, q 2 , ■ ■ ■ , qk and children ci, • ■ ■ , c m . D(a, b)N has 
all the arcs of N except (a, b). Then D(v)D(a,b)N removes v and all arcs 
involving v and adds new arcs ( qi,Cj ). Note that no arc ( qi,Cj ) is the same as 
(a, b ) unless a = qi for some i and b = Cj for some j, (in which case (a, b) is 
present again in D(v)D(a,b)N). But this case is excluded by hypothesis. 

Similarly, D(v)N removes v and all arcs involving v. It adds new arcs (qi, Cj ) 
(if not already present). The arc (a, b) is still present. It remains redundant 
unless the only path in N from a to b that avoids (a, b) was a,v,b in which case 
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a is a parent of v and b is a child of v: this case is excluded by hypothesis. Next 
D(a,b)D(v)N deletes (a, b). 

Thus D(a, b)D(v)N = D(v)D(a , b)N since their vertices and arcs are the 
same. 

(iii) Let v have parents a, qi,q 2 ,'" iQk and children b,c i,--- ,c m . Then 
D(a, b)N has all the arcs of N except (a, b). Next D(v)D(a, b)N removes v and 
all arcs involving v and adds new arcs (a, 6), ( a,Cj ), (qi,b) and ( qi,Cj). Note 
that (a,b) is still present in D(v)D(a,b)N. 

Similarly, D(v)N removes v and all arcs involving v. It adds new arcs (a, Cj ), 

( q t ,b ), (. qi,Cj ) (if not already present). The arc (a, b) is still present. We see 
that D(v)N = D(v)D(a, b)N since their vertices and arcs are the same. 

(iv) Let the children of v = a in N be Ci, • • • , c m , b. Then D(a , b)N has the 
same arcs as N except that (a, b) is missing. Hence D(a)D(a,b)N has all arcs 
of N removed that involve a, and it has new arcs (q t , Cj ). 

On the other hand D(a)N has all arcs involving a removed and new arcs 
( q%,Cj ) and (qi,b). 

Note that in N, a has a child d so d < b since (a, b) was redundant in N. 
Hence (qi,b) is redundant since there is a path g* < d < b. Thus D(qi,b) is a 
well-defined operation that deletes (qi,b). 

It follows that [rii=i D(qi,b)]D(a)N has the same arcs as N except that 
all arcs involving a have been removed, and there are new arcs (■ qi,Cj ). The 
network is thus the same as D{a)D(a , b)N. 

(v) Suppose that in N, the parents of b are a, qi,--- ,q m ■ Then D(a,b)N 
is the same as N except that (a, b) is removed. The parents of b are now only 
qi, ■ ■ ■ , 5 ^. Then D(b)D(a, b)N has all arcs involving b removed, plus new arcs 

On the other hand, D(b)N has all the arcs of N involving b removed and 
has new arcs ( qi,Cj ) and (a,Cj). Note that ( a,Cj ) is redundant in D(b)N since 
in N there is a child d of a and a path a < d < b < Cj. If we apply D(a , Cj ) then 
all that changes is that ( a,Cj ) is removed. Thus [ UU c j )] D(b)N has all 
the arcs of N involving b removed and with new arcs (q u Cj). The network thus 
agrees with D(b)D(a , b)N. □ 

Corollary 6.2. Let N' be a CPS of N with vertex set V(N) — W, where W is 
a subset ofV(N). Let W = {w\,W 2 , ■ ■ ■ ,Wk} (in any order). Then N' can be 
written in the form RD(wk) • ■ ■ D(u>i)N where R is a product of D(at, bi) for 
redundant arcs (a*, bi ) of D(wk) • ■ ■ D(wi)N. 

Proof. By hypothesis, N' can be written as a composition D m ■ ■ • D\N where 
each Di is of form either D(wj) or D(a , b). But by Theorem 6.1, each D(wj)D(a : b) 
can be rewritten as SD(wj) where S' is a product of D(a.i, bi). In this way all the 
D(wj ) can be moved to the right so that there are no deletions of a redundant 
arc to the right of any D(wj). □ 
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7 Invariant properties 

This section proves the following partial converse to Theorem 5.2 and considers 
its consequences. 

Theorem 7.1. Let N and N' be DC X-networks such that V(N') C V(N). 
Assume 

(i) Whenever u < v in N', then u < v in N. 

(ii) Whenever u and v are in V(N') and u < v in N, then u < v in N'. 
Suppose N' contains no redundant arc. Then N' is a CPS of N. 

Thus conditions (i) and (ii) together with the absence of redundant arcs 
imply that N' is a CPS, obtained from N by a composition of operations of 
form D(v) and D(a,b). 

Since the networks are DC, the condition V(N') C V(N) can be rephrased 
by saying that there exists (j> : V(N') —> V(N) such that, for all v € V(N'), 
cl(v;N') = cl(<f)(v)-, N). 

Proof. Let W = V(N) — V(N'). Write W = {wi,w 2 ,--- ,Wk} in some fixed 
order. There is a composition S = D(a m , b m ) ■ ■ ■ D(ai, b\) of deletions of re¬ 
dundant arcs (a*,6j) of N such that N\ := SN contains no redundant arcs. 
Let N 2 = D(wk) ■ ■ ■ D(w\)Ni. There is then a composition U of deletion of 
redundant arcs such that N 3 = UN 2 contains no redundant arcs. Note that N 3 
is a CPS of N by definition. 

We claim that N' = N 3 . By the choice of W, N' and N 3 have the same 
vertex set. There remains to show that the arcs of N' and N 3 are the same. 

Let (u,v) be an arc of N'. By (i) u < v in N. Since N 3 is a CPS of N, it 
follows u < v in N 3 by Theorem 5.2. If (u,v) is not an arc of N 3 , then there 
exists a vertex e of N 3 such that u < e < v in N 3 . Note e is a vertex of N' as 
well by the choice of W. Since N 3 is a CPS it follows u < e < v in N, whence by 
(ii) u < e < v in TV'. It follows that (u,v) is redundant in N', a contradiction. 
This shows that (u,v) is an arc of N 3 . 

Now assume that (u,v) is an arc of N 3 . Since N 3 is a CPS of N, it follows 
that u < v in N. By (ii), it follows that u < v in N'. We claim (u,v) is an arc 
of N'. If not, then there exists a vertex e of N' such that u < e < v in N'. By 
(i), u < e < v in TV. Since N 3 is a CPS of N it follows that u < e < v in N 3 . 
Hence (u,v) is redundant in N 3 , a contradiction. This shows that (u,v) is an 
arc of N 1 . □ 

Lemma 7.2. Suppose N and N' are DC X-networks with the same vertex set 
V. Assume 

(i) Whenever u < v in N' then u < v in N. 

(ii) Whenever u < v in N then u < v in N'. 

Suppose neither N nor N' has a redundant arc. Then N = N'. 

Proof. The vertex sets are the same. We show that the arcs are the same. 

Suppose (a, b) is an arc of N'. Then a < b in N' so a < b in N. We claim 
that (a, b) is an arc of N. If not then there exists c such that a < c < b in N. 
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It follows that a < c < b in N' as well, whence (a, b) is a redundant arc of N', 
a contradiction. Hence (a, b) is an arc of N. 

A symmetric argument shows that if (a, b) is an arc of N, then (a, b) is an arc 
of N'. It follows that N = N' since they have the same vertices and arcs. □ 

Write X = {xi,x 2 ,--- > x n}- Let Tr(X) denote the set of trivial clusters, 
Tr{X) = {X, {xi}, {x 2 }, • • • , {x n }}. 

Theorem 7.3. Let N = (V. E,r,X) be a DC X-network. Let V’ be a subset of 
V containing Tr{X). There exists a unique DC X-network N' such that 

(1) V(N , ) = V I . 

(2) N' has no redundant arcs. 

(3) N' is a CPS of N. 

Proof. Let U = V — V’. The members of U are the vertices that must be 
deleted from N to obtain the vertices of N'. Write U = {ui, u 2 , • • • , Uk} in some 
fixed order. There is a composition Si = D(a mi b m ) • • • D(a\, b\) of deletion of 
redundant arcs such that iVi := S\N contains no redundant arcs. Let _/V 2 := 
D(uk) ■ ■ • D(ui)Ni. There is then a composition S 2 of deletions of redundant 
arcs such that N 3 = S 2 _/V 2 contains no redundant arcs. Note that N 3 is a CPS 
of N by definition and V(N 3 ) = V'. This proves the existence. 

For the uniqueness, let N' be another such X-network. If u < v in N 3 , then 
u < v in N, so u < v in N 1 . Similarly if u < v in N', then u < v in N, so 
u < v in N 3 . Since neither N 3 nor N 1 has redundant arcs, Lemma 7.2 implies 
N 1 = N 3 . □ 

The unique N 1 of Theorem 7.3 will be denoted N(W ), where W = V(N') — 
Tr. Note that the members of W are the nontrivial clusters of N'. Thus N(W) 
is the unique CPS of N with no redundant arcs and with vertex set WUTr. We 
may also denote it by N(wi, • • • ,Wk) where the elements of W are Wi, ■ ■ ■ ,Wk- 

Theorem 7.4. Let N = (V,E,r,X) be a DC X-network. Let W and W' be 
subsets ofV disjoint from Tr. Then NfW 1 ) is a CPS of N(W) iffW 1 C W. 

Proof. If N(W') is a CPS of N(W), then 

Tr U W' = V(N(W')) C V(N(W)) = W U Tr. 

Since both W and W' are disjoint from Tr, it follows W’ C W. 

Conversely, suppose W' C W. Let V\, ■ ■ ■ , Vf :: be a listing of V — W’ such 
that V \, ■ • • , Vj are the members of V — W while Vj+ 1 , • • • ,Vk are the members of 
W — W’. There exists a composition R of deletions of redundant arcs such that 
N(W) = RD{vj) ■ ■ ■ D(vi)N. Let Ni = D(vk) • • • D(vj+i)N(W). This network 
has vertex set Tr U W'. There is a composition S of deletions of redundant arcs 
such that SNi has no redundant arcs. By Theorem 7.3, SNi = N(W'). Hence 
N{W’) is a CPS of N(W). □ 

An X-network N' is displayed by an X-network N if N' is obtained by 
deleting arcs of N and possibly contracting arcs (a, b) if a has out-degree one. 
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Figure 3: A DC X-network N with tree N( 5) that is a CPS of N and also a 
tree T that is displayed by N but is not a CPS of N. 


It is important to recognize that, for example, having a tree T displayed by a 
DC network N is not the same as having the tree T be a CPS of N. Figure 3 
shows a network N and a CPS N( 5) of N which is displayed by N. It also shows 
a tree T that is displayed by N but that is not a CPS. One recognizes that T is 
not a CPS since cl(5;T) = {1,2}, which is not the cluster of any vertex of N. 

Suppose N has no redundant arcs. Suppose N' satisfies the hypotheses of 
Theorem 7.1 except that N' contains a redundant arc. Then N' need not be a 
CPS of N. To see this, consider the network N in Figure 4, which is a rooted 
tree. The accompanying network N' satisfies the hypotheses of Theorem 7.1 
except that there is a redundant arc. But every CPS of a tree is a tree by 
Theorem 5.3, so N' is not a CPS of N. Hence Theorem 7.1 is not true without 
the assumption that there are no redundant arcs. 

The next result shows that the example is typical. 

Theorem 7.5. Let N and N 1 be DC X-networks such that V(N') C V(N). 
Assume 

(i) Whenever u < v in N', then u < v in N. 

(ii) Whenever u and v are in V(N') and u < v in N, then u < v in N'. 

Then there exists a CPS N" = (V",E",r,X) of N with no redundant arcs and 
a collection of pairs (a*, bf) for i = 1, • • • , k where ai and bi lie in V" , a* < 
in N", (Oj,6j) ^ E", such that N' is obtained by adjoining arcs {ai,bf) to E". 

Proof. Let (a±, b±), ■ ■ ■ , (a*,, bk) denote the redundant arcs of N'. 

Let N" = D(ak,bk) ■ ■ ■ D(ai,bi)N'. Then N" has no redundant arcs. By 
Theorem 5.2, N" satisfies 
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Figure 4: N' is not a CPS of N 


(i) Whenever u < v in N" , then u < v in N. 

(ii) Whenever u and v are in V(N") and u < v in N, then u < v in N". 

Hence by Theorem 7.1, N" is a CPS of N. It follows that N’ is obtained by 

adjoining the arcs (a*, bf) back to the CPS N". □ 


8 An example 


In this section we compute the inheritance metric in an example and also find 
some cluster-preserving simplifications of an initial network N £ DC(X). For 
this example, we are also able to compute CPS trees that “best fit” the initial 
network N. The idea is that an initial N £ DC{X ) could be very complicated 
and the following problem then arises: 

Problem. Given N £ DC(X), let T(N) be the collection of CPS of N which 
are trees. Find To £ T{N) such that To minimizes D(N,T ) for T £ T(N). 

We call a solution to the problem a best fitting CPS tree for N. 

More generally, given a subset S C DC(X), we might seek a member S 0 £ S 
such that So minimizes D(N, S) for S £ S. 

Here we begin the example. Let N be the network in Figure 1. Note that 
X = {1, 2,3,4} and N is DC. List the 9 vertices in their order 1, 2, • • •, 9. Then 
the adjacency matrix A is the matrix 


A = 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
1 1 0 
0 1 1 
0 0 1 
0 0 0 
0 0 1 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 1 1 
0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
1 1 0 
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The longest path has length 3, 

so 

H = 

/+A+A 2 +A 3 . Hence the inheritance 

matrix H = H(N) is 

' 1 

0 

0 

0 

0 

0 

0 

0 

0 ' 



0 

1 

0 

0 

0 

0 

0 

0 

0 



0 

0 

1 

0 

0 

0 

0 

0 

0 



0 

0 

0 

1 

0 

0 

0 

0 

0 


H = 

1 

1 

0 

0 

1 

0 

0 

0 

0 



0 

1 

1 

0 

0 

1 

0 

0 

0 



0 

0 

1 

1 

0 

0 

1 

0 

0 



1 

2 

1 

0 

1 

1 

0 

1 

0 



1 

2 

3 

1 

1 

1 

1 

1 

1 



As a check, note that Hgg = 3 since there are 3 directed paths from 9 to 3. 

Suppose one sought examples of CPS of N that are trees. Consider, for 
example, Ti = D(9,3)D(7)D(8,2)D(6)D(9,3)N, shown in Figure 5. Vertices 
have the same labels in T\ as in N. This is possible since the vertices are 
identified with their clusters. Since T\ has no redundant arcs and nontrivial 
vertices 5 and 8, T\ = AT(5,8) by Theorem 7.3. 



Figure 5: The best fitting CPS trees Ti = N{ 5,8) and T 2 = N( 6,8) for N in 
Figure 1. 


To compute the distance D(N,Ti), let C consist of all clusters of N, identi¬ 
fied with the vertices of N. Note that T\ does not contain 6 or 7. Then 


cH(T i ) 


10 0 
0 1 0 
0 0 1 
0 0 0 
1 1 0 
0 0 0 
0 0 0 
1 1 1 
1 1 1 


0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
1 0 0 0 0 0 
0 1 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 10 0 10 
110 0 11 


From qH(N) = H(N ) and cH(T\), it is easy to compute D(N,Ti ) = 13. 
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Recall that Tr(X) denotes the set of trivial clusters. In our example, when 
we identify the clusters with their corresponding vertices, Tr(X) = {1,2, 3,4,9}. 

Suppose we seek the best fitting CPS trees of N by exhaustive search. Since 
there are n = 4 leaves, there are 15 binary rooted trees and 11 non-binary 
rooted trees, hence a total of 26 possible rooted trees. Any rooted tree contains 
no redundant arc. By Theorem 7.3 if it is a CPS, it must have the form N(S) 
for a subset S of {5, 6, 7, 8}; hence there are at most 2 4 = 16 networks to check. 
Moreover, since a rooted tree with 4 leaves can have at most 7 vertices, in our 
search S may contain at most 2 vertices not in Tr(X); this eliminates 5 of the 
16 networks. Table 1 shows the 11 possibilities and their distances from N. 


CPS N' 

D(N, N') 

JV(0) 

23 

N( 5) 

19 

N(6) 

19 

N( 7) 

19 

N( 8) 

18 

N( 5,6) 

14 not a tree 

N( 5,7) 

15 

N( 5,8) 

13 

N( 6,7) 

14 not a tree 

N( 6,8) 

13 

N( 7,8) 

13 not a tree 


Table 1: The 11 CPS of N with two or fewer nontrivial vertices. 

Exhaustive search in this case shows that there is a tie for the best fitting 
CPS tree of N between T\ = N{ 5,8) and T 2 = N( 6,8), both shown in Figure 
5. Both have distance 13 from N. Note that of the 26 rooted trees, only 8 are 
CPS of N. For example, the tree T 3 with nontrivial clusters {3,4} and {2,3,4} 
is not a CPS of N; this is obvious since {2,3,4} is a cluster of T 3 but not of N. 
Since T 3 is DC we may nevertheless compute D(N,T 3 ) = 25. By Theorem 7.5, 
N( 6) is a CPS of N( 6 , 7) but not of iV(5,7,8). 

9 Extensions 

(I) Hybrid vertices with out-degree 1 

Some papers (such as [T7], [T5], [7]) require that in a network every hybrid 
vertex v has out-degree 1. If c is the unique child of v, then cl(v) = cl(c ), and the 
network cannot be distinct-cluster. The results in this paper may nevertheless 
apply to such networks as follows. 

A hybrid out-degree-1 X-network N is N = (V,E, r, X) where (V,E) is a 
finite acyclic directed network with root r and leaf set X which satisfies that 
every hybrid vertex v has out-degree 1. Let H a i(X) denote the collection of 
hybrid out-degree-1 X-networks. Given N £ H a i(X), form a new network 
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as the result of contracting each edge between a hybrid vertex and its unique 
child. More specifically, given a hybrid vertex v with parents gi, <72 , ■ ■ • ,qu and 
a unique child c let V' = V — {v} and E' = [E — {(gi, v), ■ ■ ■ , (qu, v), (v, c)}] U 
{(<7i> c )j • •' , (g fc ,c)}. Thus the hybrid vertex has been replaced by its former 
child, which is now hybrid with in-degree k. No member of X is deleted, so that 
(V', E') is easily seen to be a finite acyclic directed network with root r and leaf 
set X. It is possible that a hybrid vertex is now a leaf. If this procedure is used 
recursively until there are no more hybrid vertices with out-degree 1 , we obtain 
a network we shall call the derivative of N with non-unit hybrid out-degree and 
denote H a ^ (N). The order of the various deletions do not affect the resulting 
network. 

Conversely, suppose N = (V,E,r,X) satisfies that (V,E) is a finite acyclic 
directed network with root r and leaf set X but no hybrid vertex has out-degree 
1. We may construct a new network in which every hybrid vertex has out- 
degree 1 by reversing the previous procedure. More specifically, if v is hybrid 
with parents qi,--- ,Qk and out-degree either 0 or greater than 1 , we insert a 
new vertex w, remove the arcs (gi, u), • • • , (g*,, v) and add new arcs (w, v ) and 
(qi,w),--- ,(qk,w). Note that now w is hybrid with out-degree 1 because it 
has a unique child v. If v £ X then v remains in X after the construction. 
We perform this procedure until all hybrid vertices have out-degree 1. Call the 
result the derivative of N with unit hybrid out-degree and denote it H 0 i(N). 

It is straightforward to see that if N € H 0 i(X), then H 0 \{H 0 ^\{N)) is iso¬ 
morphic with N. The argument uses that there is a one-to-one correspondence 
between hybrid vertices of N and hybrid vertices of while every hybrid 

vertex of N has out-degree 1. 

A network N £ H 0 i(X) is extended distinct-cluster if is distinct- 

cluster. Roughly, N is extended distinct-cluster when distinct vertices have 
different clusters, except that a hybrid and its unique child are allowed to have 
the same cluster. Let DC 0 \{X) denote the set of extended distinct-cluster net¬ 
works. For N\ and N 2 in DC 0 i{X) define 

D o1 (N u N 2 ) = D(H 0 ^ 

The definition makes sense since H 0 ^\(N\) and H 0 ^\{N 2 ) are in DC(X). 
Theorem 9.1. D a 1 is a metric on DC 0 i(X). 

Proof. Suppose D ol (Ni, N 2 ) = 0. By definition D(H 0 ^i(Ni), H 0 ^ 1 (N 2 )) = 0, 
so that H 0= £\(Ni) = H OJ ti(N 2 ) because I? is a metric on DC(X). But then 
Ni = H 0 i(H 0 ^i(Ni)) = H 0 i(H 0 ^i(N 2 )) = N 2 . The other properties of a 
metric are immediate. □ 


(II) Use of p-norms 

The construction of the metric D in section 4 can be easily generalized. For 
p > 1 let ||.||p denote the p-norm for an m x m matrix M regarded as being in 
R m “. Thus 


\\M\\ p = Y\\M U ^ : 1 < u,v < to] 
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Let Ni and N 2 be in DC(X). Let C be a set of X-clusters containing all 
clusters of N and N' . Our definition of the inheritance distance between N and 
N' is equivalent to cD(N,N') := \\cH — cH'\\\ and D{N,N') = cD(N,N') 
for any such C. By analogy we may define cD p (N, N') := || C H — cH’Wp and 
then define the p-norm inheritance distance between N and N' by D p (N , N') = 
cD p (N , N') for any such C. By an argument like that of Lemma 4.1, the result 
is the same for any C containing the clusters of both N and N' . Note that 
D P (N,N') will be the p-th root of an integer. In the example of Section 8, 

D 2 (N,N(5,8)) = Vl5. 

Acknowledgments. We wish to thank the anonymous referees whose rec¬ 
ommendations provided substantial improvements to this paper. 
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